CDS

Accession Number TCMCG075C29565
gbkey CDS
Protein Id XP_017984738.1
Location complement(join(4396015..4396052,4396146..4396320,4396823..4397017,4397923..4398033,4398120..4398332,4398916..4399023,4399112..4399253,4399340..4399907,4400016..4400880,4401947..4402636))
Gene LOC18586451
GeneID 18586451
Organism Theobroma cacao

Protein

Length 1034aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018129249.1
Definition PREDICTED: pumilio homolog 4 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category J
Description pumilio homolog
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
KEGG_ko ko:K17943        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003676        [VIEW IN EMBL-EBI]
GO:0003723        [VIEW IN EMBL-EBI]
GO:0003729        [VIEW IN EMBL-EBI]
GO:0005488        [VIEW IN EMBL-EBI]
GO:0097159        [VIEW IN EMBL-EBI]
GO:1901363        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGTTACAGGCAGTAACATAGATATGCTACCAACTATAGATAATGGTTTAGAAAGACATGGTGGGAATTTGGAAGATAGTTTCACTGAGCTAGAATTGATTTTGCAAGCGCATCGTAATCAACAATTTGTAGGTCGTGAAAGGGATCTTAATATATATAGGAGTGGCAGTGCTCCACCTACAGTTGAGGGATCCTTGAGTGCTGTTGGTAGTCTTTTTGCTAATCCTGATTTTGGAGACATTAATGGCATAACTGCTGTTGCTGGTAGTAGTAGTAGTAGCAATAATGGAATGCTGTCTGAAGATGAGATACGCTCACACCCTGCATATCTTTCATATTATTACTCCCATGAAAACATAAATCCAAGGCTGCCTCCACCGTTGTTATCAAAAGAGGATTGGCGTGTTGCACAAAGGTTTCAGGCTAGTGGGTCTTCCCTTGGGAACATTGGGGACTGGAGAAAGAAGAAGTTGGTTGATGGCGGTGATAGTTCGTCCTTATTTTCAATGCAGCCAGGTCTTTCTGTACAACAAGAACAAAATGATTTGATGGAACTGAGGAATACCAATGCAAGGAATACATCTAGAAAAATGTCAGCTGAGTGGCTTGATAGAGGTTCAGATGGTTTGGTTGGGCTGTCTGGTACTGGGCTTGGTGCAAGGAGGAAGAGTTTTGCTGACATTCTTCAGGATGGACTTGATCGACCTGCCACCTTATCAGGCCATCTCTCACAGCCATCAAGTCGCAATGCTTTTAGTGATATGTTGGATGCAGCTAGCATTGCTGATCCCAGTCCACCAGGTTTTCATAATGCAGCAGAGTCCATAGAGAGCTTGCCTGCTGGGGTAGCTCGTCCAGGTGTGGTAGGAGTTCAGAGCCATGGTAAAACTACTTCTCACTCTTTTGCATCTGCTGTAGGTTCATCATTATCGAGGAGTACAACTCCTGAACCATATTTAGTTGGGAGGTCTTCTGGTTCTGGACTTCCTCCTGTTGGGAGCAAGGTTGGCCATGCAGAAAAAAAGAATATCATTGGATCTAATGTCCAAAATGGGCATTCTTCTGCTGTGACTGAACTTTCTGAAATTGGAGCTACATTATCTGGGTTGACCTTATCGAAAACTAGACATGCAGATGAGAATAGTCATATGCGGTCTCAGCTTCAGGTTGATCTGGATAATCAGCTAGATTTTTCATTCAATATGCCCAATGGTCATAATCAGAGTTTGCAGCAGCAATTCATTGACAAGTCCAGTGCTGAAAAGCTTGCATTTCCTACCAACCATATCGACTTGGCAAGGAAAAAGGGAATTGCACCTAATATTAATGCTTATAATATTAGTTCCAATGGACAAGTCAGCATTCCCAAAAGAACTTCCTCTTCTGCAGATCTTTACGCAAAAGTGCATCCTTCAGGCCTTGGAAGTTTGGAAGTATGTGATGTTGGCCATCCTAATGTGAATCTTGCAAACACAGATTTCATTGGCCAACTACCCAGTGCTTATTCTGTTAACCAGAAGTTGAATTCAGCGATTAAGAACCATTTAAATGCAGGTTCCCCTTTGACTGGTACTGGGGATAGGCAAAGTTTAAATAGAGCTGGAAATCAAGGGGCTGACCTTCTTTCTCCACTTATGGATCCTCGTTATATCCAGTACTTGCAAAGAACTTCTCAGTATGGGGCACGAGCTGCAGCTAGCCCTGATTCTCTGCTTTCTGGGAACTATGTTGGTACTCTGCATGGGGATTTGGATGGCCTTCAAAAAGCATACCTTGAGGCAATATTAGCTCAACAGAAGCAGCAGTATGAACTGCCACTTTTAGGTAAAGCTGCTGCTCTGAATCATGGCTATTATGGGAATCCCTCGTATGGTCTTGGCATGCCGTTTGCTGGAAATTCAATGGCAAATTCTGTACTCCCCTCTATTGGTTCTGGAAGTATACAGAATGATAGAACTGCACGTTTTAATTCAATGATGAGAACCTCAACAGGAGCATGGCCCTCAGATATTGGTAATAATGTGGATGGAAGATTCATATCATCTTTATTAGATGAATTTAAGAACAACAAGACTAGGTGTTTTGAACTCTTAGATATCATTGATCATGTTGTTGAATTCAGTACGGATCAGTATGGTAGTCGCTTTATTCAGCAGAAATTAGAAACTGCCACAGAGGAAGAGAAGACCAAAATATTTCCTGAGATTATTCCCCATGCTCGCGCTTTGATGACTGATGTGTTTGGAAATTATGTCATACAGAAATTTTTTGAGCATGGTACAGAAAGTCAAAGAGCAGAGTTAGCCAGTCAACTTACTGGTCATGTGTTGCCTCTCAGTCTTCAAATGTATGGTTGCAGAGTGATTCAGAAGGCTTTGGAAGTTGTTGGTGTGGATCAGCAGACTGGAATGGTGGCAGAGCTTGATGGTTCAATCATGAAATGTGTTCGTGATCAGAACGGTAATCATGTTATTCAGAAGTGTATAGAGTGTGTCCCTCAGGATCGAATTCTGTTTATCATATCTGCTTTCCATGGCCAAGTTGTCGCTCTTTCTACCCACCCTTATGGTTGTCGTGTCATTCAGAGGGTTCTGGAACATTGCGATGATGTAAAAACCCAACAAATTATTATGGATGAGATCATGCTATCTGTATGCACTCTGGCACAAGATCAATATGGGAACTATGTTATTCAGCATGTTCTTGAACATGGTAAACCACATGAGCGATCTGCTATTATCAGCAAGCTTGCAGGACAAATCGTGAAGATGAGTCAGCAGAAATTCGCTTCTAATGTTGTCGAGAAGTGCTTGACTTTTGGTGGGCCTGAGGAACGTCAAATTTTGGTGAACGAGATGCTTGGTTCTACTGATGAAAATGAGCCATTGCAGGCCATGATGAAAGATCAATTTGGAAACTATGTTGTGCAAAAGGTTCTTGAGACTTGTGATGATCGGAGTCTTGAGTTGATTCTCTCTCGAATCAAGGTACATTTAAATGCCCTGAAGAGGTACACTTACGGCAAACATATTGTTTCACGCGTTGAGAAGCTTATTGCAACTGGAGAAAGGCGCATAGGATTACTTTCGTCATTGGCCGCCTAA
Protein:  
MVTGSNIDMLPTIDNGLERHGGNLEDSFTELELILQAHRNQQFVGRERDLNIYRSGSAPPTVEGSLSAVGSLFANPDFGDINGITAVAGSSSSSNNGMLSEDEIRSHPAYLSYYYSHENINPRLPPPLLSKEDWRVAQRFQASGSSLGNIGDWRKKKLVDGGDSSSLFSMQPGLSVQQEQNDLMELRNTNARNTSRKMSAEWLDRGSDGLVGLSGTGLGARRKSFADILQDGLDRPATLSGHLSQPSSRNAFSDMLDAASIADPSPPGFHNAAESIESLPAGVARPGVVGVQSHGKTTSHSFASAVGSSLSRSTTPEPYLVGRSSGSGLPPVGSKVGHAEKKNIIGSNVQNGHSSAVTELSEIGATLSGLTLSKTRHADENSHMRSQLQVDLDNQLDFSFNMPNGHNQSLQQQFIDKSSAEKLAFPTNHIDLARKKGIAPNINAYNISSNGQVSIPKRTSSSADLYAKVHPSGLGSLEVCDVGHPNVNLANTDFIGQLPSAYSVNQKLNSAIKNHLNAGSPLTGTGDRQSLNRAGNQGADLLSPLMDPRYIQYLQRTSQYGARAAASPDSLLSGNYVGTLHGDLDGLQKAYLEAILAQQKQQYELPLLGKAAALNHGYYGNPSYGLGMPFAGNSMANSVLPSIGSGSIQNDRTARFNSMMRTSTGAWPSDIGNNVDGRFISSLLDEFKNNKTRCFELLDIIDHVVEFSTDQYGSRFIQQKLETATEEEKTKIFPEIIPHARALMTDVFGNYVIQKFFEHGTESQRAELASQLTGHVLPLSLQMYGCRVIQKALEVVGVDQQTGMVAELDGSIMKCVRDQNGNHVIQKCIECVPQDRILFIISAFHGQVVALSTHPYGCRVIQRVLEHCDDVKTQQIIMDEIMLSVCTLAQDQYGNYVIQHVLEHGKPHERSAIISKLAGQIVKMSQQKFASNVVEKCLTFGGPEERQILVNEMLGSTDENEPLQAMMKDQFGNYVVQKVLETCDDRSLELILSRIKVHLNALKRYTYGKHIVSRVEKLIATGERRIGLLSSLAA